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Abstract 

We propose a new low-density parity-check code construction scheme based on 2-Hfts. The proposed codes 
' have an advantage of admitting efficient hardware implementations. With the motivation of designing codes with 

, low error floors, we present an analysis of the low-weight stopping set distributions of the proposed codes. Based 

04 ' on this analysis, we propose design criteria for designing codes with low error floors. Numerical results show that 

Q^, the resulting codes have low error probabilities over binary erasure channels. 

(D ■ 

C/^ ; I. Introduction 

^ ' Low-Density Parity-Check (LDPC) codes receive much attention recently. The advantages of the codes 
^ include capacity-approaching performance for many important channels, and highly efficient parallel 
I — I decoding algorithms. 

^ The protograph based LDPC codes are a class of LDPC codes constructed by random A^-lifts of small 
^ Tanner graphs [1]. A major advantage of protograph based LDPC codes is that decoding can be efficiently 
, O, implemented by a semi-parallel hardware structure [2]. Several protograph based LDPC code ensembles 

with low error probabilities are also previously reported, for example, [3], [4], [5]. 
^ In this paper, we propose a new LDPC code construction scheme based on a series of random 2-lifts. 

The proposed code ensembles are subsets of protograph based code ensembles by 2"-lifts, where n is a 
^ positive integer. As with the protograph based codes, decoding of the proposed codes can be efficiently 
^ implemented by the same semi-parallel hardware structure. In addition, the description complexity of the 
O proposed codes is lower than that for conventional protograph based codes. 

^ ' It is well known that LDPC codes may exhibit error floors. The error floors are caused by small 
subgraphs with certain structures in the corresponding Tanner graphs [6], [7], [8], [9]. One of the above 
Q , subgraph structures is the so called stopping set. 

\ In order to understand and predict the performance of LDPC codes, previous research has proposed 
several approaches on approximately calculating stopping set distributions and codeword distributions 
[10], [11]. These previous approaches determine the exact values of limiting exponents by large deviation 
5^ , principles; in here, exponents are defined as the logarithms of the numbers of codewords or stopping sets 
with certain relative distances, normalized by the blocklengths. 

For properly designed LDPC codes, the limiting exponents with respect to blocklengths are typically 
negative for low relative distances and zero for vanishing relative distances. In other words, low- weight 
stopping sets are typically stopping sets with vanishing relative distances; the number of these low-weight 
stopping sets scales sub-exponentially with respect to blocklength. These potentially existing low-weight 
stopping sets result in high error floors. It is desirable that these low- weight stopping sets are avoided. 

In this paper, we present an analysis of low-weight stopping set distributions of the proposed codes in 
terms of graph expansion properties. We show that low-weight stopping sets can be largely avoided if the 
protograph satisfies certain graph expansion properties. Based on this analysis, we propose design criteria 
for the proposed 2-lift based codes to have low error floors. Several numerical examples of code design 
using the design criteria are presented. The numerical results confirm our theoretical analysis. 

The rest of this paper is organized as follows. In Section |lll we introduce notations and backgrounds. 
The proposed code construction scheme is shown in Section UHl In Section |TV1 we present the analysis 



X 



on low-weight stopping set distributions. Remarks and discussions are presented in Section |Vl Based on 
the above discussions, a set of design criteria is presented. The proof is shown in Section |Vll We present 
numerical results in Section IVIII Conclusions are given in Section IVIIIl 

II. Notation and Preliminary 

A Binary Erasure Channel (BEC) is shown in Fig. [T] BEC is a model for packet network communi- 
cations, where main impairments are packet loss. The channel receives binary inputs and outputs 0, 1, 
or e. A transmitted signal is received correctly at the receiver with probability 1 — ^. With probability 
^, the receiver receives an erasure symbol e, where the transmitted bit is completely lost. The channel 
conditional probabilities are summarized as follows: 

P(y = llx = 1) = F{y = 0|x = 0) = 1 - ^, 
P(y = 0|x = 1) = P(y = l\x = 0) = 0, 

P(y = e\x = 1) = ^{y = e\x = 0) = ^, 

where, x stands for the transmitted signal, and y stands for the received signal. 

For LDPC codes transmitted over a BEC, a transmitted codeword can not be recovered if and only if 
the set of erased bits contains a stopping set. A formal definition of stopping set is given as follows. 

Definition A nonempty variable node set S is called a stopping set, if each check node in the Tanner 
graph is either connected to at least two variable nodes in S, or is not connected to any variable node in 
S. 

We will use the following widely adopted graph theoretical notations throughout this paper. Given a 
graph G, we use the notation G = {V, E) to denote that V is the set of all vertices and E is the set of all 
edges. We use (m, v) to denote the undirected edge between two vertices u and v. For a set of vertices S 
in a graph G = (V, E), we denote the neighborhood of S by N{S), 

N{S) = {u : {u,v) eE,ve S}. 

We use |iS| to denote the cardinality of a set S. We say that a graph G' = {V',E') is a subgraph of 
G = (V, E) induced by V if V C V and E' is the set of edges in E joining two vertices in V. 

We will need the following graph expansion properties in our later discussions. Let G denote a Tanner 
graph. 

Definition We say that a variable node set S in G has P vertex expansion property, if |A^(5)| > P\S\. 

Definition Let S denote a variable node set in G. Let N{B) denote the set of check nodes connected to 
nodes in a variable node set B at least twice. We say that S has (7, 1]) uniform-edge-distribution property 
if for each subset B C S with \B\ < r/|5|, we have 

\N{B)\<j\B\. (1) 

We use [-J and [■] to denote the flooring function and the ceiling function respectively. That is, [a;J 
is the largest integer smaller than a real number x, and \x~\ is the smallest integer greater than the real 
number x. We use He{x) to denote the entropy function with base e, 

He{x) = — xlogg (x) — (1 — a;)logg(l — x) 

where < x < L We use H2{x) to denote the entropy function with base 2. We denote the set difference 
of two sets A, and B as A\B, 

A\B = {u\u eA,u^ B}. 

Throughout this paper, we assume all graphs are simple graphs; that is, the graphs do not contain loop 
and parallel edges. 



III. Code Construction Scheme 

In this Section, we present the proposed code construction scheme. We need the following definition 
for A^-lift graphs. A similar definition can also be found in [1]. 

Definition Given a graph G = {V,E), a graph G = {V, E) is said to be a A^-lift of G if the following 
conditions hold. 

• The vertex set V is the union of disjoint vertex sets Vi, . . . , Vn, where Vi, . . . , Vjv have the same 
cardinality as V . That is, 

V = ViU---UVn, 

\V\ = \Vi\ = --- = \Vn\. 

• There exists a covering map tt : Vi x ■ • ■ x Vat ^ 

• Given an edge (u,f) in the graph G, if a vertex ui G ^^^{u) C V, then ui is connected to exactly 
one vertex in -n^^iy). 

We call the graph G the base graph of G, and the map 7r(-) the covering map from G to G. 

A A^-lift can also be visualized as an operation of copying N times and edge-permutating. Two examples 
of 2-lift graphs are shown in Fig. |21 The 2-lift graph shown in sub-figure (a) consists of two identical 
copies of the base graph without edge permutation. The 2-lift graph shown in sub-figure (b) consists of 
two copies of the base graph with one pair of edges permuted. In the sequel, we say that a graph G 
is a uniformly random 2-lift of a graph G, if G is randomly chosen as one of 2-lifts of G with equal 
probability. 

The Proposed Code Construction Scheme: A small sized Tanner graph is first constructed by compu- 
tational searching or algebraic construction. We denote this Tanner graph by Go and call it the protograph. 
The Tanner graph of the resulting code is obtained by recursively applying the uniformly random 2-lift 
procedure several times. That is, we construct a sequence of Tanner graphs Gi, . . . , G^, ■ ■ ■ ,Gm, such 
that Gfc is a uniformly random 2-lift of Gk-i, for k = 1, . . . , M. The graph Gm is the Tanner graph of 
the resulting code. 

IV. Low- Weight Stopping Set Distribution 

In this section, we will present an analysis for the distributions of low-weight stopping sets. We will 
need the following notations. 

• As with Section |llll we denote the sequence of Tanner graphs obtained in the code construction 
scheme by Go, Gi, . . . , Gm- 

• Denote the covering map from Gk to G^-i by vrfc(-). 

• Denote the number of variable nodes in the protograph by uq. 

• Denote the minimal weight of stopping sets in the protograph by do, 

do = min \S\. (2) 

5 is a stopping set 

Proposition 4.1: Let G, G be two Tanner graphs, where the graph G is a A^-lift of G. Let tt denote the 
covering map from G to G. If 5 is a stopping set in the A^-lift G, then the set 5 = 7r(5) is a stopping 
set in the base graph G. 

Corollary 4.2: In the proposed code construction scheme, the minimal weight of stopping sets in Gm 
is greater than or equal to do- 

Condition A: Let P, 7, 77, d be real numbers, 0</9, 0<7<1, and d > 0. We say that a set 5 in a 
Tanner graph G satisfies Condition (A) with respect to (3, 7, r], d, if the following conditions hold. 



• The set S have P vertex expansion property. 

• The set S have (7, i]) uniform-edge-distribution property. 

• The neighboring check nodes N(S) of S in the graph G have an average degree less than d in the 
subgraph induced by S Li N{S). 

In the sequel, we say that a variable node set have strong graph expansion properties, if the set satisfies 
the Condition (A) with a large P and a small 7. 

Theorem 4.3: Let 5 be a stopping set in the Tanner graph Gk- Assume that the set S satisfies Condition 
(A) with respect to 7, r], d. Let 6* be a real number such that 9\S\ is an integer, 9 > l—r]. Let ^(5, 0) 
denote the number of stopping sets S in Gk+i with \S\ = (1 + nk+i{S) = S. Then, the expectation 

E [^(5, 9)] is upper bounded as follows. 



E[m{S,9)] < exp 



He{9) + (1 



log,2-2(/3-7^)( 2 



{/3rf/{/3-7e))' 
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(3) 



Corollary 4.4: Let 6^0 be a positive real number such that, 9q <1 — rj, and for all < ^ < 

^ ^ (/3d/(/3-7e)) 



H,{9) + {l-9) log, 2-2{(3- ^9) J < -A, (4) 

where A is a positive real number. Then, with high probability, there is no stopping set S with 'Kki.S) = S, 
\S\ < (1 + 6'o)|'5| in Gfc+i, if \S\ is sufficiently large. 

Lemma 4.5: Let 5 be a stopping set in the graph Gk- Assume that S satisfies Condition (A) with 
respect to (3, 7, rj, d. Let 5 be a stopping set in the graph Gk+i with TXk+i{S) = S , \S\ = [l + 9)\S\, 
9 >9q. Then, S satisfies Condition (A) with respect to /?', 7', rj' , d', where 

. 2/3 - 7(1 - 9) 
P > mm ; 

9o<e<i 1 + ^ 

7'<^7, r^' = 2(1-9,), d^<^ (5) 

For the special case where Go, . . . ,Gk are (d^, dc) regular LDPC codes, S satisfies Condition (A) with 
respect to (3', 7', r]', d', where 

> „i„ (6, 

6»o<6»<l 1 + 9 

In this special case, the following relationships always hold: 

7'<y, r^' = 0, d^<|. (7) 

V. Remark and Discussion 

Remark 1: Proposition 14.11 and CoroUarv 14.21 show that the minimal weight of stopping sets in the 
resulting Tanner graph is always not less than the minimal weight of stopping sets in the protograph. 
Hence, it is desirable that the protograph has a large minimal weight of stopping sets. 

Remark 2: According to Theorem 14.31 and Corollary 14.41 with high probability, there is no stopping 
sets with weight less than {1 + 9q)\S\. In order to have a large 9,, d should be minimized and f3, 7 should 
be maximized. The above discussion implies that stopping sets with strong graph expansion properties 
are less problematic in terms of introducing low-weight stopping sets. Hence, the protograph Go in the 



code construction scheme should be chosen so that stopping sets with weak graph expansion properties 
are avoided. ^ ^ 

Remark 3: Lemma |431 implies that, with high probability, the stopping sets S with 7rk+i{S) = S also 
have strong graph expansion properties. Lemma |431 presents a worst-case analysis. We conjecture that S 
have much stronger graph expansion properties in terms of average case analysis. 

Remark 4: In hardware implementations. Tanner graphs need to be stored. It can be checked that 0{m) 
bits are needed to store a Tanner graph of the proposed codes, where m is the number of edges. For general 
random-constructed LDPC codes, log2(m!) ^ 0(m log2(m)) bits are needed. The description complexity 
of the proposed codes is much lower. 

Based on the above discussion, we propose the following design criteria for designing codes with large 
stopping distance: 

• the protograph should have a large minimal weight of stopping sets; 

• the protograph should not contain low- weight stopping sets with weak graph expansion properties. 



VI. Proof of Theorem I4.3I 

We first consider a randomly generated variable node set S in the graph Gk+i chosen from all variable 
node sets A, 7rk+i{A) = S, \A\ = (1 + 9)\S\ with equal probability. Let E denote the random event that 
the set 5 is a stopping set. We can bound the probability of this random event as follows. 

F{E} 5^P(G,.+i = G)F{E\Gk+i = G) 

G 



G 



n^iG) 



G 



exp{[i7,(^)+log,(2)(l-^^)]|5|} 



= exp{-\S\H,ie) - i\og,2)il - e)\S\}E[^{S,e)] (8) 

In the above inequality, (a) follows from the total probability theorem, where G denotes the realization of 
the random graph Gk+i, (b) follows from a simple counting argument on the discrete probability space 
where ni{G) denote the number of stopping sets A in G with TCk^i^A) = S, \A\ = (1 + 9)\S\, n2{G) 
denote the number of variable node sets A in G with nk+i{A) = S, \A\ = (1 + 9)\S\; (c) follows from 
the fact that the number of variable node set A, nk+i^A) = S with cardinality (1 + 9)\S\ is 
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2(i-^)|5| (9) 



and the Stirling approximation [12, page 284] 

2nH2{k/n) 



< p j < 2"^2('=/") (10) 



n + 1 \k 

Given the variable node set S, let Si denote the set of variable nodes n in 5 with '!t~^{u) C S. Let 
So = S\Si. It is clear that \Si\ = 9\S\, \So\ = (1 — 9)\S\. Let the set C denote the set of check nodes 
c such that c is in Gk, and c is connected to Si at most once. We further consider an arbitrary chosen 
check node set C in Gk+i such that 7rfc+i(C) = C, \C\ = \C\. It is clear that for each pair of check nodes 
ci, C2 G 'n'kliic) with c e C, exactly one of ci, C2 is in C. By the hypothesis. 



\C\ > i/3 ~ ^9)\S 



(11) 



Let rrij denote the number of check nodes c G C such that 7rfc(c) is not connected to any variable node 
in Si, 7Ck{c) has degree j in the subgraph induced by variable node set S and its neighboring check node 
set N(S). Let nj denote the number of check nodes c E C such that 7rfc(c) is connected to exactly one 
variable node in Si, nk{c) has degree j in the subgraph induced by variable node set S and its neighboring 
check node set N(S). 

The probability that the set 5 is a stopping set can be upper bounded as follows. 



(a) ^ 

F{E} < H 



exp 



(b) 

< exp 



< exp 



(d) 

< exp 



< exp 





exp j^nj loj 



1 



i-1 



-2) m^- + rij 



-2)(/5-7^)|5|f- 



.^N (/3<i/(/3-7e))- 



fSd/iis-je) • 



(12) 



The right hand side of (a) is the probability that the check node set C does not contain any check node 
that is connected to S exactly once. The inequality (b) follows from the inequality loge(l — x) < — x, for 
< X < 1. The inequality (c) follows from the fact that the degree j is always greater than or equal to 
2. The inequality (d) follows from the convexity of exponential functions and that the average of degree 
j is less than (3(1/ {(3 — 76^). The inequality (e) follows from the Ineq. [TT] 
The Theorem follows from the Ineq. (El), and ([T2b . 

VIL Numerical Results 

In this Section, simulation results for the proposed 2-lift based LDPC codes are presented. We in- 
vestigate both codes with algebraically constructed protographs and codes with protographs obtained by 
computational searching. 

We construct a (3,4) regular code with blocklength 992 bits, rate 0.27. The protograph is chosen as 
one LDPC code previously reported in [13]. The protograph is a (3,4)-regular code with blocklength 124, 
dimension 33, minimal distance 24. We find that the protograph does not contain small stopping set by 
exhaustive searching. The parity-check matrix corresponding to the protograph is as follows: 



H 



h ho ho h 
hb h9 h hi 



where, h represents fc-times cyclically right-shifted identity matrices of size 31 x 3L 

We depict the error probability performance of the above 2-lift based code in Figs. |3l and |4l The 
code is compared with a random (3, 4) regular LDPC code. The channels are BECs with various channel 
parameters ^. The error probabilities of the proposed code are represented by solid curves, while the error 



probabilities of the random (3,4) regular code are represented by dashed curves. The two codes have 
almost identical error probability performance. 

We also construct codes with protographs obtained by computational searching. We show the parity- 
check matrix of a protograph in Fig. |5] The protograph does not contain low- weight stopping sets with 
weak graph expansion properties. The protograph contains 32 variable nodes and 16 check nodes. The 
degree distributions of the protograph is chosen to approximate the degree distributions in Eqs. [131 fT4l 
which are obtained by density evolution [14]. The blocklength of the resulting code is 8192. The rate is 
0.5. 

A(x) = 0.1896a; + 0.4806^^ + 0.1032a;" + 0.2266a;^^ (13) 

p{x) = 0.8693^*^ + 0.1307a;^ (14) 

We depict the error probability performance of the above 2-lift based code in Figs. |6l and|71 The code 
is compared with a random irregular code with degree distributions in Eqs. [131 [13 and a random (3, 6) 
regular LDPC code. The channels are BECs with various channel parameters ^. The 2-lift base code has a 
waterfall position close to that of the random irregular code. However, the proposed code does not exhibit 
an error floor as the random irregular code. The 2-lift base code outperforms the random (3, 6) regular 
code in terms of error probability performance, because the code has a better waterfall position. 

VIII. Conclusion 

This paper proposes a new construction schemes for LDPC codes based on random 2-lift. We present 
an analysis for low-weight stopping set distributions. Based on this analysis, we propose design criteria for 
the resulting codes to have low error floors. Numerical results show that the proposed codes outperform 
the previously constructed codes in terms of error probability performance. 
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Fig. 1. Binary erasure channel 



Gi G2 

(a) Two copies of the base graph 



Gi G2 

(b) A 2-lift with one edge permutation 
Fig. 2. Examples of graph hft 
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Fig. 3. Bit error probability of a 2-lift based code with an algebraically constructed protograph. The channels are BECs with various 
channel parameters. 




Fig. 4. Block error probability of a 2-lift based code with an algebraically constructed protograph. The channels are BECs with various 
channel parameters. 
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Fig. 5. The parity-check matrix corresponds to a protograph obtained by computational searching. The 
The columns correspond to parity-check equations. 



rows correspond to variable nodes. 
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Fig. 6. Bit error probability of a 2-lift base code over BECs. 
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Fig. 7. Block error probability of a 2-lift base code over BECs. 



